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ROBERT  F.  ENGLE 
NUMBER  76  AUGUST  1971 


THE  SPECIFICATION  OF  THE  DISTURBANCE 
FOR  EFFICIENT  ESTIMATION  * 


I.   Introduction 

It  was  early  recognized  that  estimating  the  model 

1)    y  =  XB  +  £ 
Ee  =  0 
Eee'  =  aft 

by  OLS  was  generally  inefficient  unless  Q   ■  I.   If  ft  is  known  a  priori, 

then  Aitken's  estimator  is  easily  computed  and  enjoys  the  best  linear 

unbiased  property.   In  general,  however,  R  is  not  known  and 

Aitken's  estimator  is  not  estimable  from  the  data  since  there  are 

(T+1)T 
■» — ~ —  additional  parameters  to  estimate  and  only  T  observations. 

Several  approximations  to  Aitken's  estimator  are  possible.   One 

is  to  make  a  guess  as  to  the  correct  covariance  structure  while  the 

other  is  to  assume  that  there  are  only  a  few  parameters  in  fi  and 

to  estimate  them  from  the  data.   The  latter  approach  appears  most 

reasonable  and  has  been  widely  used  by  econometricians  without 

any  careful  study  of  the  rationale  for  particular  paraoeterizations 

or  the  consequences  of  a  bad  choice.   One  exception  to  this  void 

is  a  paper  by  Wallis  [ 7  ]   in  which  he  chooses  a  fourth  order 

autoregressive  process  to  describe  the  disturbance  in  a  quarterly 

model  since  seasonal  terms  must  surely  be  included  in  an  economic  model. 

Other  than  Wallis'  paper  most  authors  have  merely  chosen  a  first  order 

* 
The  author  is  indebted  to  Robert  Hall,  M.I.T.,  for  helpful 

conversations  on  this  material. 
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markov  process  to  approximate  the  disturbance  on  the  implicit 
assumption  that  some  adjustment  is  better  than  none.  The 
purpose  of  this  paper  is  to  examine  that  assumption.   In  Section  II, 
we  reproduce  some  broad  efficiency  bounds  developed  by  Watson. 
In  Section  III,  we  seek  finer  bounds  for  this  specific  comparison 
and  apply  these  to  the  first  order  markov  model  in  a  true  second 
order  world.   In  Section  IV,  we  investigate  numerically,  the 
typical  efficiencies  under  various  assumptions  and  in  Section  V, 
we  present  the  extracted  conclusions. 

II.   The  Watson  Bounds 


The  ordinary  least  squares  (OLS)  estimator  of  model  (1) 
has  variance  covariance  matrix 

2)  Var  bQLS  =  (X'X)"1  X'ftXCX'X)"1  . 

If  however  a  covariance  matrix  H  has  been  assumed,  then  the  estimator 
will  be 

3)  bu  =  (X'H'-'-X)"1  X'H_1Y 

n 

which  has  variance  covariance  matrix: 

4)  var  bu  -  (X'H-1*)-1  X,H~1f2H"1X(X,H"1X)"1  . 

a 

It  is  now  our  intention  to  compare  (2)  with  (4)  for  various  choices 
of  H,  J2  and  X. 

The  matrices  H  and  ft  have  the  fortunate  property  that  since 
they  result  from  covariance  stationary  processes,  their  eigenvectors  are  approximately 
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identical  with  eigenvalues  which  are  the  values  of  the  spectral 
density  functions  evaluated  at  multiples  of   2tt/   .   The  approximation 
is  exact  either  if  the  processes  are  defined  as  circulants  (See  Hall  [  5  ]) 
or  as  the  sample  size  becomes  large.   Specifically,  we  can  follow 
Fishman  [  1  ,  p.  212]  constructing  a  unitary  matrix  U,  where 

[U]M  -  — i—  exp/^  (k-n)(j-n)\  ne  [  T/2  ] 

which,  for  only  one  exogenous  variable,  reduces  (2)  and  (4)  in 
the  limit  to 


-2 


*£y»«)      *j_„ 


6)         Ivar  „        _  I  £    I       f..(6)d6    |  »    1   .  fx<e>£e<9>de 


(*    £     ^ 


ivarbH  fx(8)f-(e)dei  f  (9)f-2(e; 


x        e        h 


respectively  since 


UfiU  =  Diag  f    (9.) 

(UX)*(UX)      =   f    (9   ) 
J  J  x      j 

it 
where  6  =  Tr(j-n)    J  ■  1,  ...  T,   means  conjugate  transpose, 
-1     n 

and  f  (8.)  is  the  spectral  density  of  the  process  z  at  frequency  8  . 
z  J  j 

Asymptotically  the  sums  are  replaced  by  integrals.   From  a 
similar  argument  Aitkan's  estimator  has  asvmototic  variance 
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Thus  the  asymptotic  efficiency  of  b„  relative  to  generalized 

H 
least  square  (GLS)  is 

2 


9)   Effnv"H 


(  f*  f  (8)  f;1(9)deV 

(bu)    .U-TT   X       H       J 


f  *  fx(ej)f"1(6)d6r7T  fx(6)fe(6)f^2(8)d9 


The  subscript  on  Eff  will  be  taken  to  be  the  alternative  estimator 
which  in  this  case  is  GLS  where  the  covariance  matrix  is  assumed 
to  be  J3.   This  expression  has  an  upper  bound  of  1  by  Jensen's 
inequality  and  from  Watson  and  Hannan  [9  ]  and  Watson  [8  ]  we  find 
the  lower  bound  to  be 


10)  Effn(b„) 


>    ,.      _el  k'  H  err 


"n    H    =        VVV/  VV. 


V^^    fH(9lV 

where  9,  and  91  are  the  frequencies  where   e/,   reaches  its  minimum  and 

maximum,       Clearly  if  this  ratio  is  a  constant,  then  the  lower 

f  f fl^ 
bound  is  unity  and  Eff  =  1.  The  lower  bound  is  taken  on  when   x 

fH(8) 

is  non-zero  at  only  two  frequencies  9  and  9  and  at  these 

frequencies,   x/f   is  the  same.   The  upper  bound  is  realized  if  f  , 

fH 

is  constant   everywhere  that  f      is   non-zero.      Thus,    for  a  given  tl 

and  H,    the  actual  efficiency  depends   on  the  spectrum  of   the  exogenous 

variable. 

The  relative  efficiency  of  assuming  H  as  opposed  to  OLS  is  then 
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given  by 

EffQ(V 
">    EffOLS  V  -  ^777-   * 

Letting  L^  and  L    be  the  Watson  lower  bounds  for  the  efficiency 
of  each  of  these  estimators,  we  obtain  the  broad  bounds 

12>     LH  <    EffOLS<V  <   L^T   ' 

Since  our  choice  of  estimator  must  rest  on  whether  this  term  is 

less  than  or  greater  than  one,  this  inequality  is  not  particularly 

helpful.  In  particular,  it  is  very  unlikely  that  there  exists 

an  X  process  which  would  make  either  of  these  bounds  realizable. 

From  (1)  we  see  that  if  f  is  close  to  zero  for  any  frequency, 

LnT _  will  be  very  small  and  if  it  does  so  in  a  wav  that  f„  cannot 
OLo  n 

reproduce,  then  so  will  L„.   Watson  has  calculated  the  values 

n 

of  these  lower  bounds  for  several  situations  and  they  are  remarkable  for 

being  often  close  to  zero.   Clearly,  a  bad  choice  of  H  can  make 

L„  arbitrarily  close  to  zero. 
H 

III.   Efficiency  of  the  First  Order  Markov  Model 

In  order  to  narrow  the  bounds  on  H  relative  to  OLS  we  will  prove 

two  lemmas.   First  let  us  abbreviate    C   £„  (9)f ^(9)d9    by  E[f,(9)] 

Jo 


x  z  z 


f 

J  o 


r  f    (6)d9 
x 


where  it  must  be  remembered  that  there  is  no  random  component;   this 

is  merely  notation.      Then  we  can  write    the  basic   efficiency  definition  as 

13)  EfWV  .     ««0»«2'> 

Ef£(9)f^2(9) 
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Lentma  1:   A  necessary  condition  for  the  lower  bound  of  (13)  to 

be  greater  than  unity  is  that  f  (0)  and  fu  (0)  be  negatively  correlated 

for  all  f  (0).   That  is,  Ef  (0)f~1(0)  -  Ef  (0)Ef~X(0)  <  0  for  any  f  (0) . 
X  £     H  E      n  X 

Proof:   Suppose  there  exists  an  f  (0)  such  that 

Ef/.(0)f"1(0)  >  Ef^(e)EC1(0).   Because  f^(9)>  0, 
£     rl      —    En  E 

E[f£(6)(fy1(0)  -  Ef^(0)  )2]  >  0 

unless  f„  (0)  is  constant,  in  which  case,H  is  identical  with  OLS. 
Squaring,  we  have 

Ef£(9)  f~2(9)  -  2E[f£(0)fH"1(9)]Ef^1(0)  +  Ef£(9)E2f^1(9)  >  0 

and  by  hypothesis  the  middle  terms  can  be  replaced,  giving 

Ef£(9)f~2(9)  -  Ef£(0)  E2f^(0)  >  0 

which  shows  that  for  this  f  (0),  OLS  is  superior  to  H. 

Lemma  2:   For  continuous  functions  u(0),  v(0) ,  if  there  is  a  point 

0-  where  the  slopes  are  the  same  sign,  then  there  exists  a  density 

function  f(9)  which  makes  these  functions  positively  correlated. 

Proof:  Let  f  =|  a   6=8 
'l-a  0  =»  82 
0   otherwise 

Eu(0)v(6)  -  Eu(9)Ev(0)  =  a(l-a)  [uO^-u^)  ]  [v(81)-v(82)  ] 

Let  N(6  )  be  the  neighborhood  of  9  where  the  slopes  are  of  the 

same  sign.   Then  letting  8^  02  e  N(0Q)  gives  the  result. 

By  equating  u(0)  with  f  (9)  and  v(0)  with  f" (8)  this  lemma  with 


lemma  1  establishes  that  a  necessary  conditions  for  the  lower  bound 


to  be  greater  than  one  is  that  f  (8)  and  f~  (9)  mu9t  have  opposite  slopes. 
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We  now  apply  this  result  to  the  assumption  that  the  disturbance 
is  generated  by  a  first  order  raarkov  process.   This  assumption 
means  that  f„(9)  ■  where  p  is  the  estimated  first  order 

H    l   ie  i  2 

1-pe 


serial  correlation  coefficient  of  e  which  asymptotically  is 

15)    p  -  E££-l 

2 
e  e 

This  estimator  will  be  called  autoregressive  least  squares  (ALS) 
and  f.,c  has  the  property  that  it  is  either  always  increasing  or 
always  decreasing  depending  on  whether  p  is  positive  or  negative 
respectively.   For  most  general  processes,    f  (6)  will  change 
slopes  and  so  ALS  is  not  guaranteed  to  be  an  improvement. 

To  be  more  specific,  we  will  assume  that  the  true  disturbance 
is  generated  by  a  second  order  process  and  investigate  the  characteristics 
of  ALS.  Describing  the  second  order  process  by  its  autoregressive  roots, 
y,  and  Yoi  there  are  six  possible  configurations  of  the  power  spectrum  as 
shown  in  figure  1  on  the  following  page.   In  addition,  if  one  of  the 
roots  is  very  small  (roughly  less  than  .1),  then  some  of  these  situations 
look  instead,  just  like  the  first  order  cases  (a)  and  (e) .   We  will  not 
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V  Y2  >  ° 


(b) 


Y,  >0,  Y,  <  0,Yn  +  Y,  >  0 


1    '2 


(d) 


yl   >  0,  Y2  <  0,  YL  +  Y2  <  0 


Yx  =  Y2   ReY  <  0 


(e) 


V  Y2  <    ° 


(f) 


Y;L  =  Y2    ReY  <  0 


Spectra  of  second  order  markov  processes  with  roots  Y, »  Y9« 

FIGURE   1 
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consider  these  cases  as  they  are  clearly  so  close  to  first  order, 

that  there  is  litte  loss  from  assuming  them  to  be  first  order. 

The  estimate  of  p  for  a  second  order  model  is  simply  p  =  (Y-.+Y,,)  /(1+Y,  Y9) 

which  is  positive,  roughly  if  the  overall  slope  of  the  spectrum  is 

negative  and  negative  if  it  slopes  upward. 

From  lemmas  1  and  2  we  see  that  if  the  underlying  model  disturbance 
is  of  types  (b),  (c) ,  (d) ,  or  (f),  then  the  efficiency  of  ALS  relative 
to  OLS  has  a  lower  bound  below  one  and  ALS  is  not  necessarily  superior 
to  OLS.   Only  cases  (a)  and  (e)  remain  for  possible  dominance  of  ALS 
and  from  lemma  3,  we  see  that  these  do  satisfy  the  necessary  condition 
of  lemma  1. 

Lemma  3:   If  u(8)  and  v(8)  are  both  non-decreasing  (increasing)  or 
both  non- increasing  (decreasing)  dif ferentiable  functions  in  the  interior 
of  the  interval,  then  they  will  be  non-negatively  (positively)  correlated  for  any 
density  function.    If  they  move  in  opposite  directions,  the  corresponding 
results  will  hold. 
Proof:  Eu(9)v(8)  -  Eu(6)Ev(6) 

=  E[(u(9)  -  u(X)  )  (v(6)  -  v(X)  )  ] 
where  v(6)  -  v(X)  and  the  expectations  are  taken  to  be  integrals. 
Apply  the  mean  value  theorem 

=  e[u'(9  )  v'o    xe-xn 

* 

which  has  all  elements  greater  than  or  equal  to  zero  since  6 

** 

and  9   must  always  be  in  the  interior  of  the  interval.   We  have 
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established  that  ALS  cannot  dominate  OLS  for  any  second  order  process 

except  that  in  (a)  or  (e)  but  we  have  not  shown  anything  about  cases 

(a)  and  (e) . 

We  next  develop  a  sufficient  condition  for  the  lower  bound 

to  be  greater  than  unity. 

Lemma  4:   A  necessary  and  sufficient  condition  for  the  lower 

bound  to  be  greater  than  unity  is  that  f_  (0)  is  negatively  correlated 
a 

with  f  (OKf"1^)  +  Ef"1(0))  for  all  f  (9). 
E H. n X 

Proof:   By  hypothesis 

Ef£(9)f~2(9)  +  Efe(6)f^1(6)Ef^1(6)  -  Ef^1(e)tEfe(6)f^1(0) 

+  EfeO)Efy1(0)]  <  0 

which  simplifies  to  give 

Ef£(9)f~2(9)  -  Ef£(9)E2f^1(9)  <  0. 

If  this  is  true  for  all  f  (0),  then  the  lower  bound  is  greater  than  one. 
By  lemma  3  we  see  that  we  require  that  f„  (8)  be  always  increasing 
and  f  (8)  (fu  (9)  +  Ef   (9))  always  decreasing  or  vice  versa. 

Calculation  of  [f  (9)(f~1(9)  +  Ef~1(9)  )]   for  9  near  zero  shows 

£■  H  H 

that  the  term  has  the  sign  of  -sin  9[(y1  +  Y2)  C1  +  Y-tYn)  "  Y1Y2^ 

-1' 

while  fR   (9)  has  the  sign  of  sin  9(Y1+Y2)/(1+Y1Y2) • 


For  both  Y-,  •  Y?  >   0  and  neither  particularly  small,  these  signs 

will  be  the  same.   For  y  and  Y2  <  °»  the  situation  will  be  identical 
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for  6  =  ir.  Thus  for  none  of  the  six  types  of  second  order  process, 

is  the  lower  bound  greater  than  unity.  Only  when  one  of  the 

roots  is  very  small  is  it  possible  that  ALS  is  uniformly  superior  to  OLS, 

III.  Typical  Efficiency  Levels 

In  the  face  of  these  pessimistic  results  for  ALS  estimation, 
we  must  ask  what  efficiency  levels  are  reasonable  in  standard 
econometric  practice,  rather  than  whether  the  lower  bound  is  greater 
or  less  than  unity.   Grenander  [ 3  ]  and  Grenander  and  Rosenblatt  [ 4  ] 
established  that  a  general  condition  for  OLS  to  be  fully  efficient 
is  that,  the  spectrum  of  the  disturbance  must  be  flat  at  all  points 
where  the  exogenous  variable  has  spectral  weight.   For  economic 
times  series,  the  exogenous  variables  typically,  (See  Granger  [  2  ]), 
have  their  spectral  weight  at  low  frequencies  and  OLS  will  be 
efficient  if  the  disturbance  spectrum  is  flat  at  low  frequencies. 
Malinvaud  [ 6  ]  suggests  as  a  rule  of  thumb  that  only  if  the  first 
order  serial  correlation  "considerably  exceeds"  0.5  should 
we  look  for  an  estimator  other  than  OLS.   Consequently,  moving 
from  OLS  to  ALS  which  is  itself  an  inefficient  estimator,  should 
not  yield  much  of  a  gain  in  efficiency  unless  serial  correlation 
is  substantial. 

In  order  to  explore  the  typical  situation  we  will  examine 
the  efficiency  of  ALS  relative  to  OLS  for  exogenous  variables 
x^hich  have  their  entire  spectral  weight  in  the  lower  part  of  the 
spectrum.   Returning  to  the  classifications  in  Figure  1,  it  can 
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be  shown  using  lemmas  3  and  4  that  for  processes  (b) , 
(e)and(f^ALS  will  always  be  superior  if  the  spectral  weight 
of  X  is  entirely  in  the  first  half  of  the  diagram.  On  the  other 
hand,  for  processes  (c)  and  (d) ,  OLS  will  dominate  if  the  spectrum 
of  X  falls  entirely  below  the  point  of  the  minimum  in  (c)  or  max 
in  (d)   (actually  the  point  at  which  the  division  occurs  is 
generally  somewhat  beyond  this  critical  point) .  Thus  we  are 
left  with  only  case  (a)  which  is  uncertain. 

We  established  in  the  previous  section,  that  for  case  a, 
OLS  would  be  superior  in  the  neighborhood  of  the  origin. 
The  percentage  of  the  spectrum  over  which  this  condition  is  valid 
was  computed    numerically  for  several  values  of  the  real  positive 
autoregressive  parameters  and  for  the  complex  roots  for  which 
the  imaginary  part  is  so  small  that  the  spectrum  has  no  turning  point. 
These  precentages  are  shown  in  tables  1  and  2. 


The  value  of  Ef   (9)  clearly  depends  on  waht  X  is  being  considered, 

-1 
Since  the  emphasis  is  on  low  frequency  X's  the  value  of  f   (0)  was  used 

throughout.   This  has  the  effect  of  slightly  extending     the  range  of 

the  condition  supporting  OLS. 
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TABLE  1 

Percent  of  spectrum  starting  at  origin 

for  which  OLS  is  more  efficient  than  ALS. 

Real  Positive  Autoregressive  Root3 


Yl= 
Y2=0.0 
0.1 
0.3 
0.6 
0.7 
0.8 
0.9 
0.95 
0.98 
0.99 


0.0 

0.1 

0.3 

0.6 

0.7 

0.8 

0.9 

0.95 

0.98 

0.99 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

0 

12 

16 

14 

12 

8 

6 

4 

2 

14 

12 

10 

7 

4 

3 

2 

10 

8 

5 

4 

2 

1 

6 

4 
3 

3 
2 

1 

2 
1 
1 
0 

1 

1 
0 
0 
0 

Period 


TABLE  2 

Percent  of  spectrum,  starting  at  origin, 
for  which  OLS  is  more  efficient  than  ALS. 
Complex  autoregressive  Rootsl 


00 

20 

10 

6.667 

5 

4 

3.33 

2.22 

M=.l 

0 

0 

0 

0 

21  ! 

50 

0 

0 

.2 

0 

0 

0 

21 

f  ~35  " 

50 

0 

0 

.3 

12 

15 

21 

29 

1   38 

50 

0 

0 

.4 

18 

20 

24 

i — si  — 

39 

50 

0 

0 

.5 

17 

19 

24 

I    31 

39 

50 

0 

0 

.6 

14 

16 

1 

23 

30 

39 

50 

0 

0 

.7 

10 

14 

_) 

21 

30 

39 

50 

0 

0 

.8 

6 

!"Tl" 

20 

30 

39 

50 

0 

0 

.9 

3 

1     io 

19 

29 

39 

50 

0 

0 

.98 

0 

I    9 

19 

29 

39 

50 

0 

0 

1/  Cases  below  dashed  line  are  of  type  (d) ,  those  above  are  (a). 
Periods  shorter  than  4  are  symmetrically  types  (e)  and  (f). 
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The  conclusions  to  be  drawn  from  these  tables  are  rather  mixed. 
It  is  unreasonable  to  expect  that  any  exogenous  variable  will 
have  its  spectral  weight  entirely  in  the  first  few  percent  of  the 
spectrum.   Thus,  for  large  correlations,  we  would  expect  OLS  to  be 
inefficient.  For  intermediate  values  of  the  autoregressive  parameters, 
the  percentages  are  higher, suggesting  that  for  plausible  series, 
OLS  may  definitely  be  superior.   However,  as  we  observed  before; 
when  at  least  one  parameter  is  small,  ALS  becomes  the  optimum 
estimating  procedure  again. 

In  order  to  clear  the  ambiguity  of  these  results  ve  examined 
the  efficiency  of  ALS  versus  OLS  for  a  specific  exogenous  variable 
which  has  the  typical  spectral  shape  and  which  appears  to  flatten 
the  spectrum  of  many  economic  time  series.   The  process  was 
assumed  to  be  second  order  with  both  roots  equal  to  .8. 
The  relative  efficiencies  of  the  estimators  in  this  environment 
are  presented  in  tables  3  and  4  for  the  same  cases  as  in  1 
and  2. 
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TABLE  3 

Relative  efficiency  of  ALS  vs.  OLS  when  exogenous 
variable  is  second  order  with  roots  both  .8. 
Real  Positive  Autoregressive  Roots 


0.0 

0.1 

0.3 

0.6 

0.7 

0.8 

0.9 

0.95 

0.98 

0.99 

0.0 

1.0 

1.0 

1.01 

1.07 

1.17 

1.48 

2.89 

6.58 

19.87 

54.18 

0.1 

1.00 

1.01 

1.08 

1.19 

1.53 

3.02 

7.08 

21. 

58.52 

0.3 

1.20 

1.09 

1.22 

1.64 

3.56 

8.39 

25.51 

69.63 

0.6 

1.24 

1.53 

2.36 

5.39 

12.74 

38.68 

105.7 

0.7 

1.95 

3.01 

6.98 

16.40 

14.80 

136.4 

0.8 

4.63 

10.55 

24.63 

75.19 

207.2 

0.9 

23.56 

55.51 

175.6 

497.5 

0.95 

135.9 

467.4 

1409. 

0.98 

1949. 

6685. 

0.99 

24508. 

M«. 


TABLE  4 

Efficiency  of  ALS  relative  to  OLS  when 
exogenous  variable  is  second  order  with  both  roots  .8, 

Complex  Roots  ^ 


Period 


CO 

20 

10 

6.667 

5 

4 

3.33 

2.22 

1 

1.002 

1.001 

1.001 

1.000 

1.000 

1.000 

1.000 

1.001 

2 

1.008 

1.007 

1.004 

1.001 

j    I. 000 

1.000 

1.001 

1.004 

3 

1.020 

1.015 

1.006 

1.000 

.999 

1.000 

1.002 

1.010 

4 

1.034 

1.018 

.997 

.993 

.996 

1.000 

1.004 

1.019 

5 

1.062 

.999 

.957 

.971 

.989 

1.000 

1.007 

1.036 

6 

1.237 

.979 

1       .855 

.926 

.977 

1.000 

1.012 

1.071 

7 

1.953 

1.008 

!       .698 

.845 

.957 

1.000 

1.019 

1.149 

8 

4.632 

"  1.T527 

.494 

.714 

.920 

1.000 

1.033 

1.343 

9 

23.56 

.811 

.271 

.501 

.832 

1.000 

1.073 

1.951 

98 

1949. 

,        .379 

.092 

.204 

.556 

1.000 

1.338 

6.591 

1/  See  note  to  table  2. 
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These  tables  give  two  results.  First,  for  type  (a)  processes 
the  efficiency  never  falls  below  .96  although  it  rises  to  several 
thousand  for  very  high  autocorrelations.   Secondly,  however,  for 
moderate  values  of  the  parameters  (i.e.,  first  order  serial  correlation 
less  than  .6)  the  maximum  of  the  efficiency  of  ALS  is  1.07.   Thus, 
while  typical  values  for  the  efficiency  under  case  (a)  suggest  that  ALS 
may  be  preferable,  the  gain  in  efficiency  may  not  be  worth  the 
effort  of  using  another  estimator.   This  result  is  radically 
changed  if  the  values  of  the  parameters  indicate  pronounced 
serial  correlation,  in  which  case,  ALS  is  far  superior  to  OLS. 

In  conclusion, for  underlying  second  order  processes  of  type 
(a),  (b),  (e)  and  (f ) ,  ALS  is  the  recommended  procedure  while 
for  types  (c)  and  (d)  ,0LS  should  generally  give  a  superior  estimate. 
In  practice,  the  characteristics  of  the  true  distrubance  process 
could  be  determined  by  estimating  the  equation  by  OLS  and  then 
computing  the  first  order  serial  correlation  coefficient.  If  its  size  is  sig- 
nificant, spectral  density  function  of  the  residuals  would  then  be  estimated, 
since  asymptotically  it  is  the  spectrum  of  the  disturbance, 
and  thus  the  process  would   be  roughly  classified  into  one  of  the 
six  types. 

IV.   Conclusion 

We  have  examined  the  asymptotic  efficiency  of  estimators  which 
assume  a  first  order  markov  process  for  the  disturbance  relative  to 
ordinary  least  squares,  when  the  true  disturbance  is  instead  a  second 
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order  markov  process.   The  efficiency  depends  on  the  exogenous 
variable  in  the  equation  and  it  is  shown  that  only  when  the 
second  order  process  has  one  root  near  zero,  is  ALS  necessarily  more 
efficient.  However,  for  econometric  applications,  we  can  look  speci- 
fically at  the  low  frequency  segment  of  the  spectrum.  Separating 
the  types  of  spectra  for  the  second  order  process  into  six 
general  shapes,  we  can  then  choose  which  estimator  is  likely 
to  be  superior.   If  the  spectrum  of  the  disturbance  has  an 
overall  slope  which  is  opposite  to  the  slope  at  low  frequencies, 
then  OLS  should  be  superior;  otherwise,  ALS  is  recommended. 
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